-140- 



SEQUENCE LISTING 



(1) GENERAL INFORMATION : 

(i) APPLICANT: Role, Lorna W. 

(ii) TITLE OF INVENTION: SPLICE VARIANTS OF THE HgkEGULIN GENE , 
nARIA, 

AND USES THEREOF 
(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Cooper & Dunham L! 

(B) STREET: 1185 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

( E ) COUNTRY : U.S.A. 

(F) ZIP: 10036 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC constat ible 

(C) OPERATING SYSTEM: pfc-DOS/MS-DOS 

(D) SOFTWARE: Patent lis Release #1.0 



Version #1.30 



(vi) 



(viii) 



CURRENT APPLICATION E3ATA 

(A) APPLICATION NUMBER 

(B) FILING DATE: 

(C) CLASS IFICATI0N 

ATTORNEY /AGENT INFORMATION: 

(A) NAME: Whi/e , John P. 

(B) REGISTRATION NUMBER: 28,678 

(C) REFERENCE/DOCKET NUMBER: 4 6 83 9 -A 



(ix) TELECOMMUN/CATION INFORMATION: 

(A) TELEPHONE: 212-278-0400 

(B) TELEFAX: 212-391-0526 



(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 

) LENGTH: 3212 base pairs 
'(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CGGATGCTGC TGCTACTGTC ACTTCTGCCG CTGCCGCTGT TGT7ACAGAT TTTGCTTTTG 
60 

CTCCTTCTAC CGCATGACAA TTGTTTTCCT CGCCTAAGCA y^ATACCAGCC TCAGATGCTC 
120 



AAGGTGAGAG TCTTGCCTTT CGCTCTGGGC TATTGGT: 
180 



r CA CTTAATCCGG TCAATTTGTT 



CGCTGCTCGT GGTTGTCTTT CTCCCCGCCC TCCT7CCCCC TGTTTTGTTT TGTTTCGCTT 
240 



GCTTTCGGGG GGACGCTCCT TCCCTCAGTC A( 
300 



fAAGAGCTG GAATTGCTTG AGAGGCGTAT 



AAGGAATTAT AAAAGTGGCC AGGAAACACC/ AGCGCAGTGA CTGCAGAGCT GCCCTTGGCT 
3 6'0 

TCGGCAAGGC AGCGTGAGCG G CAGAGGJ& C T CGGGCAGGGG GCGGGGGGTC TCCTTTTTCC 
420 

CGTGCGTTCC TCTTCTCCCA GTTCQfGATGA TGTTGCTGTT TCGGACCTCT CGCTGACTCC 
480 

TGCCCTGTGA TTTTTGCTGA GQfeCTGTGAC TGTTACTCCG TCTCTTTCTG TCTGTGTTTC 
540 



ACAGTAATGG ACTGTGATAG, 
600 



AGTTAAGGCC TTTTGGAGGT GAGCTGTGTC ACAGCTGATG 



CTTAAACATG TCTGAAGUAG GCACCGAGAC TTTCCCCAGC CCCTCGGCTC AGCTGAGCCC 
660 



TGATGCATCC CTTGG< 
720 



!GGGC TCCCGGCTGA GGAGAACATG CCGGGGCCCC ACAGAGAGGA 



CAGCAGGGTC CC^GGTGTGG CAGGCCTGGC CTCGACCTGC TGCGTGTGCC TGGAAGCAGA 
780 

GCGACTGAAG j&GCTGCCTCA ACTCTGAGAA GATCTGCATC GCCCCTATCC TGGCTTGCCT 
840 

GCTCAGCC/C TGCCTCTGCA TTGCTGGCCT C AAGTGGGT C TTTGTGGACA AGATTTTTGA 
900 

gtatgXctct CCTACACACC TTGACCCTGG GAGGATAGGA CAAGACCCAA GGAGCACTGT 
0 



^TrrTara nn^ryn^n-rn nnyn^n^nn TT^ r ^inTn TZ^^T^/" CCTTCCCCAT 




ACCTAGCCTT GAGAGCAAGG 
1080 

CAGGCCCTTC CTTCAGCCTT 
1140 

TGCCACACCG TCACTGTCAC 
1200 

AACAGAAACC AATCTCCAAA 
1260 

AAGTCATCTC ACAAAATGTG 
1320 

CTACATGGTT AAAGACCTCC 
1380 

TACTGGTGAT CGCTGCCAAA 
1440 

ATTTATGGAA GCTGAGGAAC 
1500 

TGCTCTTCTA GTAGTTGGCA 
1560 

GAAAAAGTTG CATGACCGCC 
1620 

CATGGCAAAT GGGCCACACC 
1680 

GTACGTTTCA AAAAACATAA 
1740 / 

GTTTTCCACA AGCCACTACA 
1800 / 

TAG C C A C AG C TGGAGTAATG 
1860 / 

CGTCAGCTCC TCAGTGGAGA 
1920 / 

CAATGGCAyr GGTGGGCCAA 
1980 / 

TGACTCZCTAC CGAGACTCTC 
204m 
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CTGAAGTGAC AGTGCAAACT 
CTCTCTACAA CCGCATCCTA 
CATCCTCCCT GGAGCCTACC 
CTGCTCCAAA ACTTTCCACT 
ACATAAAGCA GAAAGCCTTC 
CAAACCCTCC ACGATACCTA 
ACTACGTAAT GGCCAGCTTC 
TGTACCAGAA XcGGGTGCTG 
TCATGTGTGT GGTGGCCTAC 
TTCGGCAGAG CCTTCGCTCA 
AOCCCAACCC ACCACCAGAC 
TCTCCAGTGA ACGTGTCGTT 
CCTCAACAAC TCATCACTCC 
G C CAT AC CG A AAGCATTCTC 
ATAGCAGGCA CACCAGCCCA 
GGGAAGGCAA CAGCTTCCTC 
CTCACAGTGA AAGGTATGTC 



- > 

GACAGCTCGC pZ?GTGCCCTC 

GATGTCGG^T TGTGGTCCTC 

ACGGGATCTC AGGCACAAGC 

T^TACATCTA CAACTGGGAC 

TGTGTAAATG GGGGAGAGTG 

TGCAGGTGCC CAAATGAATT 

T AC AAG CAT C TTGGGATTGA 

ACCATAACTG GCATTTGCAT 

TGCAAAACCA AGAAGCAGAG 

GAGAGGAACA ACGTTATGAA 

AATGTCCAGC TGGTGAATCA 

GAGCGAGAAA CCGAGACCTC 

ATGACAGTCA CCCAGACGCC 

TCCGAAAGCC ACTCCGTGCT 

ACAGGGCCAC GAGGCCGCCT 

CGGCATGCAA GAG AG AC C C C 

TCAGCTATGA CCACACCAGC 



-143 - 



TCGCATGTCA CCCGTTGATT TCCACACTCC AACTTCTCCC AAGTCCCCTC/CATCTGAAAT 
2100 / 

GTCACCACCA GTTTCCAGCT TGACCATCTC CATCCCTTCG GTGGCGGTGA GTCCCTTTAT 
2160 / 

GGACGAGGAG AGACCGCTGC TGTTGGTGAC CCCACCACGG CTG^GTGAGA AGTACGACAA 
2220 / 

CCACCTTCAG CAATTCAACT CCTTCCACAA CAATCCCACC J^ATG AG AG C A ACAGTCTGCC 
2280 / 

ACCCAGTCCT CTGAGGATAG TGGAGGATGA AGAGTATGAG ACCACGCAGG AGTACGAACC 
2340 / 

AG C AC AGG AG CCTCCAAAGA AACTCACCAA CAGCOSGAGG GTGAAAAGAA CAAAGCCCAA 
2400 / 

TGGCCATATT TCCAGCAGGG TAGAAGTGGA C T G AC AC A AG C T C T C AG A GCACTAGCTC 
2460 / 

T GAG AG C G AA ACAGAAGATG AAAGAATAGG /TGAGGATACA CCATTTCTTA GCATACAAAA 
2520 / 

TCCCATGGCA ACCAGTCTGG AGCCAGCCCSC TG CAT AT C GG CTGGCTGAGA ACAGGACTAA 
2580 / 

CCCGGCAAAT CGCTTCTCCA CACCAGAAGA GTTGCAAGCA AGGTTGTCCA GTGTAATAGC 
2640 / 

TAACCAAGAC CCTATTGCTG TAT^AGACAT AAACAAAACA CAT AG ATT C A CATGTAAAAC 
2700 / 

TTTATTTTAT ATAATGAAGT ^TTCCACCTT TAAATTAAAC AATTTATTTT ATTTTAGCAA 
2760 T 

TTCCGCTGAT AGAAAACAAG AGTGGAAAAA GAAACTTTTA TAAATTAAGT ATACGTATGT 
2820 / 

ACAAATGTGT TATGTGCCAT ATGTAGCAAT TTTTTACAGT ATTTCCAAAA TGGGGAAAGA 
2880 / 

TATCAATGGT GCOTTTATGT TATGTTATGT TGAGAGCAAG TTTTGTACAG CTACAATGAT 
2940 / 

TGCTGTCCCG AgTATTTTG CAAAACCTTC TAGCCCTCAG TTGTTCTGGC TTTTTTGTGC 
3000 / 

ATTGCATTAT AATGACTGGA TGTATGATTT GCAAGAATTG CAGAAGTCCC CATTTGCTTG 
3060 7 

TTHTr^AATP rCPAGATCAA AAAGCCCTGT TATGGCACTC ACACCCTATC CACTTCACCA 
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3120 

GGAAAAAAAA AAAATCAAAA AAAAAAAAAA AAAAAAAAGA AAAGAAACJAG AAAAAAGAAA 
3180 

AGAAAAAGAA AAAAAAAGCT GAAAAAATAA AA 
3212 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1070 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEflf ID NO : 2 : 
Gly Cys Cys Cys Tyr Cys Hi/s Phe Cys Arg Cys Arg Cys Cys Tyr 



Arg 



1 



10 



15 



Phe Cys Phe Cys Ser Pfte Tyr Arg Met Thr lie Val Phe Leu Ala 



Xaa 



20 



25 



30 



Ala Asp Thr Ser Lfeu Arg Cys Ser Arg Xaa Glu Ser Cys Leu Ser 



Leu 



35 



40 



45 



Gly 



Trp Ala lie /Gly Ser Leu Asn Pro Val Asn Leu Phe Ala Ala Arg 



50 



55 



60 



Cys Leu/Ser Pro Arg Pro Pro Ser Pro Cys Phe Val Leu Phe Arg 



Leu 



65 



70 



75 



80 



Le/ Ser Gly Gly Arg Ser Phe Pro Gin Ser Glu Glu Leu Glu Leu 



Leu 



85 



90 



95 



Gin 



Glu Arg Arg lie Arg Asn Tyr Lys Ser Gly Gin Glu Thr Ar# Ala 

100 105 yio 



Xaa Leu Gin Ser Cys Pro Trp Leu Arg Gin Gly Ser /7a! Ser Gly 



Arg 



115 



120 



125 



Gly Leu Gly Gin Gly Ala Gly Gly Leu Leu Phef Pro Val Arg Ser 



Ser 



130 



135 



140 



Ser Pro Ser Ser Asp Asp Val Ala Val Sfir Asp Leu Ser Leu Thr 



Pro 
160 

Ser 



145 



150 



155 



Ala Leu Xaa Phe Leu Leu Ser Ala yVal Thr Val Thr Pro Ser Leu 



165 



170 



175 



Gly 



Val Cys Val Ser Gin Xaa Trr/ Thr Val lie Glu Leu Arg Pro Phe 



180 



185 



190 



Gly Glu Leu Cys His S/r Xaa Cys Leu Asn Met Ser Glu Val Gly 



Thr 



195 



200 



205 



Glu Thr Phe Pro JBer Pro Ser Ala Gin Leu Ser Pro Asp Ala Ser 



Leu 



210 



215 



220 



Gly Gly Leu/ Pro Ala Glu Glu Asn Met Pro Gly Pro His Arg Glu 



Asp 



240 



Cys 



225 



230 



235 



Ser Ayg Val Pro Gly Val Ala Gly Leu Ala Ser Thr Cys Cys Val 



245 



250 



255 



,eu Glu a: 



.rg 



Leu Lys Gly Cys Leu Asn Ser Glu Lys lie 



Cys 



260 



265 



lie Ala Pro lie Leu Ala Cys Leu Leu Ser Leu Gys Leu Cys lie 



Ala 



275 



280 



285 



Gly Leu Lys Trp Val Phe Val Asp Lys liar Phe Glu Tyr Asp Ser 



Pro 



290 



295 



300 



Thr His Leu Asp Pro Gly Arg lie (Ay Gin Asp Pro Arg Ser Thr 



305 



310 



315 



Asp Pro Thr Ala Leu Ser Ala /Trp Val Pro Ser Glu Val Tyr Ala 



Ser 



325 



330 



335 



Pro Phe Pro lie Pro Ser/Leu Glu Ser Lys Ala Glu Val Thr Val 



Gin 



340 



345 



350 



Thr Asp Ser Ser Ley Val Pro Ser Arg Pro Phe Leu Gin Pro Ser 



Leu 



355 



360 



365 



Tyr Asn Arg II/ Leu Asp Val Gly Leu Trp Ser Ser Ala Thr Pro 



Ser 



370 



375 



380 



Leu Ser Pr/ Ser Ser Leu Glu Pro Thr Thr Ala Ser Gin Ala Gin 



385 



390 



395 



Thr GVu Thr Asn Leu Gin Thr Ala Pro Lys Leu Ser Thr Ser Thr 



Ser 



405 



410 



415 



.r Thr Gly Thr Ser His Leu Thr T.vs Cys Asp lie T.ys Gin Lvs 



Ala 



425 



430 



Phe Cys Val Asn Gly Gly Glu Cys Tyr Met Val Lys Asp Led Pro 



Asn 



435 



440 



445 



Pro Pro Arg Tyr Leu Cys Arg Cys Pro Asn Glu Phje Thr Gly Asp 



Arg 



450 



455 



'6 0 



Glu 



480 



Cys Gin Asn Tyr Val Met Ala Ser Phe Tyr /Lys His Leu Gly lie 



465 



470 



475 



Thr 



Phe Met Glu Ala Glu Glu Leu Tyr GKi Lys Arg Val Leu Thr lie 



485 



490 



495 



Gly lie Cys lie Ala Leu Leu/Val Val Gly lie Met Cys Val Val 



Ala 



500 



505 



510 



Arg 



Tyr Cys Lys Thr Lys Lys Gin Arg Lys Lys Leu His Asp Arg Leu 



515 



520 



525 



Gly 



Gin Ser Leu Arg /er Glu Arg Asn Asn Val Met Asn Met Ala Asn 



530 



535 



540 



Pro His His/Pro Asn Pro Pro Pro Asp Asn Val Gin Leu Val Asn 



Gin 



560 



545 



550 



555 



Glu 



Tyr Va/L Ser Lys Asn lie lie Ser Ser Glu Arg Val Val Glu Arg 



565 



570 



575 



His 



.r Glu Thr Ser Phe Ser Thr Ser His Tyr Thr Ser Thr Thr His 



580 



585 



590 



is 



Ser Met Thr Val Thr Gin Thr Pro Ser His Ser Trp Ser Asn Gly 
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595 



600 



605 



Thr Glu Ser lie Leu Ser Glu Ser His Ser Val Leu Val/Ser Ser 



Ser 



610 



615 



620 



Val Glu Asn Ser Arg His Thr Ser Pro Thr Gly /Pro Arg Gly Arg 



Leu 



640 



625 



630 



35 



Asn Gly lie Gly Gly Pro Arg Glu Gly Afen Ser Phe Leu Arg His 



Ala 



645 



650 



655 



Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro His Ser Glu Arg 



Tyr 



660 



665 



670 



Val Ser Ala Met Thr Thr Pro Ala Arg Met Ser Pro Val Asp Phe 



His 



675 



680 



685 



Thr Pro Thr Ser Pre/ Lys Ser Pro Pro Ser Glu Met Ser Pro Pro 



Val 



690 



695 



700 



Ser Ser Leu Th£ lie Ser lie Pro Ser Val Ala Val Ser Pro Phe 



Met 



720 



705 



710 



715 



Glu 



Asp Glu Olu Arg Pro Leu Leu Leu Val Thr Pro Pro Arg Leu Arg 



725 



730 



735 



Lys /Tyr Asp Asn His Leu Gin Gin Phe Asn Ser Phe His Asn Asn 



Pro 



740 



745 



750 



/ Thr His Glu Ser Asn Ser Leu Pro Pro Ser Pro Leu Arg lie Val 

Gly/ 

/ 7 55 — T&0 " 



Asp Glu Glu Tyr Glu Thr Thr Gin Glu Tyr Glu Pro Ala Gin Gl 



Pro 



770 



775 



780 



Pro Lys Lys Leu Thr Asn Ser Arg Arg Val Lys Arg Thr /Lys Pro 



785 



790 



795 



Gly His lie Ser Ser Arg Val Glu Val Asp Ser Ast> Thr Ser Ser 



Gin 



805 



810 



815 



Ser Thr Ser Ser Glu Ser Glu Thr Glu Asp J&lu Arg lie Gly Glu 



Asp 



820 



825 



830 



Thr Pro Phe Leu Ser lie Gin Asn Pre/ Met Ala Thr Ser Leu Glu 



Pro 



835 



840 



845 



Ala Ala Ala Tyr Arg Leu Ala Glu Asn Arg Thr Asn Pro Ala Asn 



Arg 



850 



855 



860 



Phe Ser Thr Pro Glu Glu heu Gin Ala Arg Leu Ser Ser Val lie 



865 



87/ 



875 



Asn Gin Asp Pro Ila/Ala Val Xaa Asp lie Asn Lys Thr His Arg 



Phe 



890 



895 



Thr Cys Lys Tht Leu Phe Tyr lie Met Lys Tyr Ser Thr Phe Lys 



Leu 



00 



905 



910 



Asn Asn /eu Phe Tyr Phe Ser Asn Ser Ala Asp Arg Lys Gin Glu 



Trp 



915 



920 



925 



Ly/ Lys Lys Leu Leu Xaa lie Lys Tyr Thr Tyr Val Gin Met Cys 



Tyr 
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930 



935 



940 



Val Pro Tyr Val Ala lie Phe Tyr Ser lie Ser Lys 



Gly Lys 



Asp 
960 

Thr 



945 



950 



lie Asn Gly Ala Phe Met Leu Cys Tyr Val Glu Ser Lys Phe Cys 

965 975 



Ala Thr Met He Ala Val Pro Xaa x/r Phe Ala Lys Pro Ser Ser 



Pro 



980 



985 



990 



Gin Leu Phe Trp Leu Phe Cy£ Ala Leu His Tyr Asn Asp Trp Met 



Tyr 



995 



1000 



1005 



Asp Leu Gin Glu Leu/Gin Lys Ser Pro Phe Ala Cys Cys Gly He 



Pro 



1010 



1015 



1020 



Arg Ser Lys S£r Pro Val Met Ala Leu Thr Pro Tyr Pro Leu His 



Gin 
1040 

Arg 



1025 



1030 



1035 



Glu Ly§/Lys Lys He Lys Lys Lys Lys Lys Lys Lys Arg Lys Glu 

1045 1050 1055 



GZu Lys Arg Lys Glu Lys Glu Lys Lys Ser Xaa Lys Asn Lys 

1060 1065 1070 

{2/ INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ TnpnT.nnv ■ 1 i n^ar 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : / 

CGGCCTGTAA GATGCTGTAT CATTTGGTTG GGGGGGCCTC TGCGTGGTAA TC2GACCGTGA 
60 / 

GAGCGGCCAG GCCTTCTTCT GGAGGTGAGC CGATGGAGAT TTATTCCCCA GACATGTCTG 
120 / 

AGGTCGCCGC CGAGAGGTCC TCCAGCCCCT CCACTCAGCT GAGTG^AGAC CCATCTCTTG 
180 / 

ATGGGCTTCC GGCAGCAGAA GACATGCCAG AGCCCCAGAC TOAAGATGGG AGAACCCCTG 
240 / 

GACTCGTGGG CCTGGCCGTG CCCTGCTGTG CGTGCCTAGR AGCTGAGCGC CTGAGAGGTT 
300 / 

GCCTCAACTC AG AG AAAAT C TGCATTGTCC CCATCOTGGC TTGCCTGGTC AGCCTCTGCC 
360 / 

TCTGCATCGC CGGCCTCAAG TGGGTATTTG TGC^ACAAGAT CTTTGAATAT GACTCTCCTA 
420 / 

CTCACCTTGA CCCTGGGGGG TTAGGCCAGg/aCCCTATTAT TTCTCTGGAC GCAACTGCTG 
480 / 

CCTCAGCTGT GTGGGTGTCG T C T G AG G?C AT ACACTTCACC TGTCTCTAGG GCTCAATCTG 
540 Y 

AAAGTGAGGT TCAAGTTACA GTGCAAGGTG ACAAGGCTGT TGTCTCCTTT GAACCATCAG 
600 / 

CGGCACCGAC ACCGAAGAAT /CGTATTTTTG CCTTTTCTTT CTTGCCGTCC ACTGCGCCAT 
660 / 

CCTTCCCTTC ACCCACCJCGG AACCCTGAGG TGAGAACGCC CAAGTCAGCA ACTCAGCCAC 
720 / 

AAACAACAGA AACTAATCTC CAAACTGCTC CTAAACTTTC TACATCTACA TCCACCACTG 
780 / 

GGACAAGCCA ^CTTGTAAAA TGTGCGGAGA AGGAGAAAAC TTTCTGTGTG AATGGAGGGG 
840 / 

AGTGCTT^AT GGTGAAAGAC CTTTCAAACC CCTCGAGATA CTTGTGCAAA GGCGGAGGAG 
900/ 

CTC^ACCAGA AGAGAGTGCT GACCATAACC GGCATCTGCA TCGCCCTCCT TGTGGTCGGC 
/ 960 

ATCATGTGTG TGGTGG CCTA CTGCA AAACC 7\J\(^7\T\T\ n T' rin ^ft* A flfl ^T H^THTY^^^ 

HT2D — ' — 
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CTTCGGCAGA GCCTTCGGTC TGAACGAAAC AATACGATGA AC AT TG C C AA 
1080 

CATCCTAACC CACCCCCCGA GAATGTCCAG CTGGTGAATC AATACGTATC 
1140 



CTCAC 



CGTC 



ATCTCCAGTG AGCATATTGT TGAGAGAGAA GCAGAGACAT CCTTTTCCXC CAGTCACTAT 
1200 

ACTTCCACAG CCCATCACTC CACTACTGTC ACCCAGACTC CTAG OtACAG CTGGAGCAAC 
1260 

GGACACACTG AAAGCATCCT TTCCGAAAGC CACTCTGTAA TQ6TGATGTC ATCCGTAGAA 
1320 

AACAGTAGGC ACAGCAGCCC AACTGGGGCC G 
1351 

(2) INFORMATION FOR SEQ ID NO : 4 : 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 449 amino acid 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



Xaa 



Glu 



Ser 



Ala 




(xi) SEQUENCE DESCRIPTION': SEQ ID NO : 4 : 
Ala Cys Lys Met Leu T/r His Leu Val Gly Gly Ala Ser Ala Trp 
1 



Trp Thr Val Arg Ala Ala Arg Pro Ser Ser Gly Gly Glu Pro Met 



lie Tyr Ser Pro Asp Met Ser Glu Val Ala Ala Glu Arg Ser Ser 



Pro Ser Jthr Gin Leu Ser Ala Asp Pro Ser Leu Asp Gly Leu Pro 
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Ala Glu Asp Met Pro Glu Pro Gin Thr Glu Asp Gly Arg Thr/Pro 



Gly 



65 



70 



75 



80 



Leu Val Gly Leu Ala Val Pro Cys Cys Ala Cys Leu/6lu Ala Glu 
Ar 9 -- 90 / 95 



85 



Leu 



Arg Gly Cys Leu Asn Ser Glu Lys Ileydys He Val Pro He 



Leu 



100 



105 



110 



Ala Cys Leu Val Ser Leu Cys Leu c/s He Ala Gly Leu Lys Trp 



Val 



115 



120 



125 



Phe Val Asp Lys He Phe Glu/Tyr Asp Ser Pro Thr His Leu Asp 



Pro 



130 



140 



Pro He He Ser Leu Asp Ala Thr Ala 



Gly Gly Leu Gly Gin A&p 

145 / 150 155 



Arg 



Ser Ala Val Trp/V* 1 Ser Ser Glu Ala Tyr Thr Ser Pr ° Val 

165 170 175 



Ala Gin Se/Glu Ser Glu Val Gin Val Thr Val Gin Gly Asp Lys 



Ala 



180 



185 



190 



Val VaQ Ser Phe Glu Pro Ser Ala Ala Pro Thr Pro Lys Asn Arg 



He 



195 



200 



205 



Ala Phe Ser Phe Leu Pro Ser Thr Ala Pro Ser Phe Pro Ser 



Pro 



210 



215 



220 



Thr Arg Asn Pro Glu Val^Ar< 



Ala Thr Hi n Pro 



In 
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225 



230 



235 



240 



Thr 



Thr Thr Glu Thr Asn Leu Gin Thr Ala Pro Lys Leu Sfer Thr Ser 



245 



250 



255 



260 



265 



Thr Phe Cys Val Asn Gly Gly Glu Cy/ Phe Met Val Lys Asp Leu 



Ser 



275 



280 



285 



Asn Pro Ser Arg Tyr Leu Cys /ys Gly Gly Gly Ala Val Pro Glu 



Glu 



290 



29 



300 



Ser Ala Asp His Asn Ar/ His Leu His Arg Pro Pro Cys Gly Arg 



Ala 



305 

His Val Cys Gly c/iy 

325 



10 



315 



Leu Leu Gin Asn Gin Glu Thr Ala Glu Lys 

330 335 



340 



345 



355 



360 



PjTo 



Ala Gly Glu Ser lie Arg He Xaa Lys Arg His Leu Gin Xaa 



Ala 



370 



375 



380 



Tyr Cys Xaa Glu Arg Ser Arg Asp He Leu Phe His Gin Ser Leu 

T[QO . ■ 3-9-5 — 1 



385 



4 0. 



Gin 
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Phe His Ser Pro Ser Leu His Tyr Cys His Pro 



405 



Cys 



Trp 




Xaa Pro 
415 



Leu Glu Gin Arg Thr His Xaa 



420 



is Pro Phe Arg Lys Pro Leu 



430 



Asn Arg Asp Val^fle Arg Arg Lys Gin Xaa Ala Gin Gin Pro Asn 



